Connectionism coming of age: legacy and future challenges
نویسندگان
چکیده
ABOUT 50 YEARS AFTER THE INTRODUCTION OF THE PERCEPTRON AND SOME 25 YEARS AFTER THE INTRODUCTION OF PDP MODELS, WHERE ARE WE NOW? In 1986, Rumelhart and McClelland took the cognitive science community by storm with the Parallel Distributed Processing (PDP) framework. Rather than abstracting from the biological substrate as was sought by the “information processing” paradigms of the 1970s, connectionism, as it has come to be called, embraced it. An immediate appeal of the connectionist agenda was its aim: to construct at the algorithmic level models of cognition that were compatible with their implementation in the biological substrate. The PDP group argued that this could be achieved by turning to networks of artificial neurons, originally introduced by McCulloch and Pitts (1943) which the group showed were able to provide insights into a wide range of psychological domains, from categorization, to perception, to memory, to language. This work built on an earlier formulation by Rosenblatt (1958) who introduced a simple type of feed-forward neural network called the perceptron. Perceptrons were limited to solving simple linearlyseparable problems and although networks composed of perceptrons were known to be able to compute any Boolean function (including XOR, Minsky and Papert, 1969), there was no effective way of training such networks. In 1986, Rumelhart, Hinton and Williams introduced the back-propagation algorithm, providing an effective way of training multi-layered neural networks, which could easily learn non linearly-separable functions. In addition to providing the field with an effective learning algorithm, the PDP group published a series of demonstrations of how long standing questions in cognitive psychology could be elegantly solved using simple learning rules, distributed representations, and interactive processing. To take a classic example, consider the word-superiority effect, in which people can detect letters within a word faster than individual letters or letters within a non-word (Reicher, 1969). This result is difficult to square with serial “information-processing” theories of cognition that were dominant at the time (how could someone recognize “R” before “FRIEND” if recognizing the word required recognizing the letters?). Accounting for such findings demanded a framework which could naturally accommodate interactive processes within a bidirectional flow of information. The so-called “Interactive-activation model” (McClelland and Rumelhart, 1981) provided just such a framework. The connectionist paradigm was not without its critics. The principal critiques can be divided into three classes. First, some neuroscientists (Crick, 1989) questioned the biological plausibility of backpropagation, when they failed to observe experimentally complex and differentiated back-propagating signals that are required to learn in multi-layered neural networks. A second critique concerned stability-plasticity of the learned representations in these models. Some phenomena require the ability to rapidly learn new information, but sometimes newly learned knowledge overwrites previously learned information (catastrophic interference; McCloskey and Cohen, 1989). Third, representing spatial and temporal invariance—something that apparently came easily to people—was difficult for models, e.g., recognizing that the letter “T” in “TOM” was the “same” as the “T” in “POT.” This invariance problem was typically solved by multiplying a large number of hard-wired units that were spaceor time-locked (see e.g., McClelland and Elman, 1986). Finally, critics pointed out that the networks were incapable of learning true rules on which a number of human behavioral, namely language-learning was thought to depend (e.g., Marcus, 2003; cf. Fodor and Pylyshyn, 1988; Seidenberg, 1999). The connectionist approach has embraced these challenges: Although some connectionist models continue to rely on backpropagation, others have moved to more biologically realistic learning rules (Giese and Poggio, 2003; Masquelier and Thorpe, 2007). Far from being a critical flaw of connectionism, the phenomenon of catastrophic interference (Mermillod et al., 2013) proved to be a feature that led to the development of complementary learning systems (McClelland et al., 1995). Progress has also been made on the invariance problem. For example, within the speech domain representing the similarity between similar speech sounds regardless of their location within a word has been addressed in the past by Grossberg and Myers (2000) and Norris (1994) and this issue presents a new more streamlined and computationally efficient model (Hannagan et al., 2013). An especially powerful approach to solving the location invariance problem in the visual domain is presented by Di Bono and Zorzi (2013), also in this issue.
منابع مشابه
Whither Activation Policies? Reflections for the Future
Whither Activation Policies? Reflections for the Future* Activation policies aimed at getting working-age people off benefits and into work have been embraced by many OECD countries. In a previous paper, I have argued that activation strategies have performed well during the Great Recession and subsequent recovery in some, but not all, of these countries. At the same time it is pertinent to loo...
متن کاملThe Trend of Age Distribution in WHO’s EMRO countries during the Last Three Decades (1980-2010) and its Future Health Policy Requirements
Introduction: Investigation trend of age distribution can be helpful for policymakers to predict the potential challenges of health care systems in order to improve responsibilities to future health needs. Therefore the aim of the present study was to investigate the trend of age distribution in the EMRO region in order to determine future health policy requirements. <str...
متن کاملFewer Epistemological Challenges for Connectionism
Seventeen years ago, John McCarthy wrote the note Epistemological challenges for connectionism as a response to Paul Smolensky’s paper On the proper treatment of connectionism. I will discuss the extent to which the four key challenges put forward by McCarthy have been solved, and what are the new challenges ahead. I argue that there are fewer epistemological challenges for connectionism, but p...
متن کاملSmall Hydro-Power Plants in Kenya: A Review of Status, Challenges and Future Prospects
Small Hydro-power Plants (SHP) are an important source of electricity in many countries. However, little is known about SHP in Kenya. This paper reviews the status, challenges in implementation of SHP and prospects for future development of SHP in Kenya. The paper shows that SHP has not yet fully utilized the available hydro-power potential. The challenges associated with SHP development should...
متن کاملThe Promise of Distributed Computing and the Challenges of Legacy Information Systems
The imminent combination of computing and telecommunications is leading to a compelling vision of world-wide computing. The vision is described in terms of next generation computing architectures, called Enterprise Information Architectures, and next generation information systems, called Intelligent and Cooperative Information Systems. Basic research directions and challenges are described as ...
متن کاملVolume1 referees
Following is a list of our colleagues who volunteered to review the manuscripts received to Geopersia during the past 12 months. I appreciate the positive actions that you have taken to participate in reviewing the manuscripts and implementing our recommendations to get the two issues published. We certainly would not be able to continue without your support and I wish to acknowledge your con...
متن کامل